Search CORE

61 research outputs found

SMOClust: Synthetic Minority Oversampling based on Stream Clustering for Evolving Data Streams

Author: Chiu Chun Wai
Minku Leandro L.
Publication venue
Publication date: 28/08/2023
Field of study

Many real-world data stream applications not only suffer from concept drift but also class imbalance. Yet, very few existing studies investigated this joint challenge. Data difficulty factors, which have been shown to be key challenges in class imbalanced data streams, are not taken into account by existing approaches when learning class imbalanced data streams. In this work, we propose a drift adaptable oversampling strategy to synthesise minority class examples based on stream clustering. The motivation is that stream clustering methods continuously update themselves to reflect the characteristics of the current underlying concept, including data difficulty factors. This nature can potentially be used to compress past information without caching data in the memory explicitly. Based on the compressed information, synthetic examples can be created within the region that recently generated new minority class examples. Experiments with artificial and real-world data streams show that the proposed approach can handle concept drift involving different minority class decomposition better than existing approaches, especially when the data stream is severely class imbalanced and presenting high proportions of safe and borderline minority class examples.Comment: 59 pages, 85 figure

arXiv.org e-Print Archive

Resampling-Based Ensemble Methods for Online Class Imbalance Learning

Author: Minku Leandro L.
Wang Shuo
Yao Xin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 05/08/2014
Field of study

Online class imbalance learning is a new learning problem that combines the challenges of both online learning and class imbalance learning. It deals with data streams having very skewed class distributions. This type of problems commonly exists in real-world applications, such as fault diagnosis of real-time control monitoring systems and intrusion detection in computer networks. In our earlier work, we deﬁned class imbalance online, and proposed two learning algorithms OOB and UOB that build an ensemble model overcoming class imbalance in real time through resampling and time-decayed metrics. In this paper, we further improve the resampling strategy inside OOB and UOB, and look into their performance in both static and dynamicdatastreams.Wegivetheﬁrstcomprehensiveanalysisofclassimbalanceindatastreams,intermsofdatadistributions, imbalance rates and changes in class imbalance status. We ﬁnd that UOB is better at recognizing minority-class examples in static data streams, and OOB is more robust against dynamic changes in class imbalance status. The data distribution is a major factor affecting their performance. Based on the insight gained, we then propose two new ensemble methods that maintain both OOB and UOB with adaptive weights for ﬁnal predictions, called WEOB1 and WEOB2. They are shown to possess the strength of OOB and UOB with good accuracy and robustness

Crossref

Birmingham City University Open Access Repository

University of Birmingham Research Portal

BCU Open Access

Leicester Research Archive

Tackling virtual and real concept drifts:an adaptive Gaussian mixture model approach

Author: Minku Leandro
Oliveira Adriano L. I.
Oliveira Gustavo H.F.M.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 29/07/2021
Field of study

University of Birmingham Research Portal

GMM-VRD:a gaussian mixture model for dealing with virtual and real concept drifts

Author: Minku Leandro
Oliveira Adriano L. I.
Oliveira Gustavo H.F.M.
Publication venue: IEEE Computer Society
Publication date: 30/09/2019
Field of study

University of Birmingham Research Portal

Diversity-based pool of models for dealing with recurring concepts

Author: Chiu Chun Wai
Minku Leandro L.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 15/10/2018
Field of study

University of Birmingham Research Portal

The potential benefit of relevance vector machine to software effort estimation

Author: Minku Leandro L.
Song Liyan
Yao Xin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 17/09/2014
Field of study

University of Birmingham Research Portal

Software effort interval prediction via Bayesian inference and synthetic Bootstrap resampling

Author: Minku Leandro L.
Song Liyan
Yao Xin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 23/02/2019
Field of study

University of Birmingham Research Portal

From flying warehouses to robot toilets - five technologies that could shape the future

Author: Minku Leandro L.
Reiff-Marganiec Stephan
Verdezoto Nervo
Publication venue: The Conversation Trust
Publication date: 27/07/2017
Field of study

Online Research @ Cardiff

Transaction profile estimation of queueing network models for IT systems using a search-based technique

Author: Harman Mark
Islam Syed
Jia Yue
Minku Leandro L.
Sarro Federica
Srivisut Komsan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

peer-reviewedThe software and hardware systems required to deliver modern Web based services are becoming increasingly complex. Management and evolution of the systems requires periodic analysis of performance and capacity to maintain quality of service and maximise efficient use of resources. In this work we present a method that uses a repeated local search technique to improve the accuracy of modelling such systems while also reducing the complexity and time required to perform this task. The accuracy of the model derived from the search-based approach is validated by extrapolating the performance to multiple load levels which enables system capacity and performance to be planned and managed more efficiently

University of Limerick Institutional Repository

Crossref

University of Birmingham Research Portal

Irish Universities